Sample Efficient Path Integral Control under Uncertainty
نویسندگان
چکیده
We present a data-driven stochastic optimal control framework that is derived using the path integral (PI) control approach. We find iterative control laws analytically without a priori policy parameterization based on probabilistic representation of the learned dynamics model. The proposed algorithm operates in a forward-backward sweep manner which differentiate it from other PI-related methods that perform forward sampling to find open-loop optimal controls. Our method uses significantly less sampled data to find analytic control laws compared to other approaches within the PI control family that rely on extensive sampling from given dynamics models or trials on physical systems in a model-free fashion. In addition, the learned controllers can be generalized to new tasks without re-sampling based on the compositionality theory for the linearly-solvable optimal control framework. We provide experimental results on three different systems and comparisons with state-of-the-art model-based methods to demonstrate the efficiency and generalizability of the proposed framework.
منابع مشابه
An Efficient Method for Selecting a Reliable Path under Uncertainty Conditions
In a network that has the potential to block some paths, choosing a reliable path, so that its survival probability is high, is an important and practical issue. The importance of this issue is very considerable in critical situations such as natural disasters, floods and earthquakes. In the case of the reliable path, survival or blocking of each arc on a network in critical situations is an un...
متن کاملPath Integral Control by Reproducing Kernel Hilbert Space Embedding
We present an embedding of stochastic optimal control problems, of the so called path integral form, into reproducing kernel Hilbert spaces. Using consistent, sample based estimates of the embedding leads to a model-free, non-parametric approach for calculation of an approximate solution to the control problem. This formulation admits a decomposition of the problem into an invariant and task de...
متن کاملPath integral control and state-dependent feedback.
In this paper we address the problem of computing state-dependent feedback controls for path integral control problems. To this end we generalize the path integral control formula and utilize this to construct parametrized state-dependent feedback controllers. In addition, we show a relation between control and importance sampling: Better control, in terms of control cost, yields more efficient...
متن کاملLocalization aware sampling and connection strategies for incremental motion planning under uncertainty
We present efficient localization aware sampling and connection strategies for incremental sampling-based stochastic motion planners. For sampling, we introduce a new measure of localization ability of a sample, one that is independent of the path taken to reach the sample and depends only on the sensor measurement at the sample. Using this measure, our sampling strategy puts more samples in re...
متن کاملDesigning a Fractional Order PID Controller for a Two-Area Micro-Grid under Uncertainty of Parameters
Increasing the number of microgrids has raised the complexity and nonlinearity of the power system and conventional controllers do not present acceptable efficiency in a wide range of operation points. In this study, a fractional order proportional–integral–derivative controller optimized with hybrid grey wolf-pattern search algorithm is used to control the frequency of each area of the microgr...
متن کامل